Non-Parametric Statistical Techniques for Computational Forensic Engineering
نویسنده
چکیده
Computational forensic engineering is the process of identification of the tool or algorithm that was used to produce a particular output or solution by examining the structural properties of the output. We introduce a new Relative Generic Forensic Engineering (RGFE) technique that has several advantages over the previously proposed approaches. From the quantitative point of view, the new RGFE technique performs not only more accurate identification of the tool used but also provides the identification with a level of confidence. A higher degree of classification is achieved by our technique with the ability to identify the output as produced by an unknown tool. We introduce a generic formulation which enables rapid application of the RGFE approach to a variety of problems that can be formulated as 0-1 integer linear programs. Additionally, we present forensic engineering scenarios which enable a natural classification of the forensic engineering task with respect to the types and amount of information available to conduct the classification. From the technical point of view, the key innovations of the RGFE technique include the development of a simulated annealing-based CART classification and clustering technique and a generic property formulation technique which provides a systematic way to develop properties for a given problem or facilitates their reuse. In addition to solution properties, we introduce instance properties which enable an enhanced classification of problem instances leading to a higher accuracy of algorithm identification. Finally, the single most important innovation, property calibration, interprets the value for a given algorithm for a given property relative to the values for other algorithms. We demonstrated the RGFE technique on two canonical optimization problems: boolean satisfiability (SAT) and graph coloring (GC) and used statistical techniques to establish the effectiveness of the approach.
منابع مشابه
Study on Unit-Selection and Statistical Parametric Speech Synthesis Techniques
One of the interesting topics on multimedia domain is concerned with empowering computer in order to speech production. Speech synthesis is granting human abilities to the computer for speech production. Data-based approach and process-based approach are the two main approaches on speech synthesis. Each approach has its varied challenges. Unit-selection speech synthesis and statistical parametr...
متن کاملA Non-parametric Control Chart for Controlling Variability Based on Squared Rank Test
Control charts are used to identify the presence of assignable cause of variation in the process. Non-parametric control chart is an emerging area of recent development in the theory of SPC. Its main advantage is that it does not require any knowledge about the underlying distribution of the variable. In this paper a non-parametric control chart for controlling variability has been developed. I...
متن کاملVery High Resolution Parametric and Non- Parametric Sartomography Methods for Monitoring Urban Areas Structures
Synthetic Aperture Radar (SAR) is the only way to evaluate deformation of the Earth’s surface from space on the order of centimeters and millimeters due to its coherent nature and short wavelengths. Hence, by this means the long term risk monitoring and security are performed as precisely as possible. Traditional SAR imaging delivers a projection of the 3-D object to the two dimensional (2-D) a...
متن کاملComparison of Gene Expression Programming (GEP) and Parametric and Non-parametric Regression Methods in the Prediction of the Mean Daily Discharge of Karun River (A case Study: Mollasani Hydrometric Station)
Nowadays, the prediction of river discharge is one of the important issues in hydrology and water resources; the results of daily river discharge pattern could be used in the management of water resources and hydraulic structures and flood prediction. In this research, Gene Expression Programming (GEP), parametric Linear Regression (LR), parametric Nonlinear Regression (NLR) and non-parametric ...
متن کاملOn the non-parametric multivariate control charts in fuzzy environment
Multivariate control chats are generally used in situations where the simultaneous monitoring or control of two or more related quality characteristics is necessary. In most processes in the real world, distribution of the process characteristics are unknown or at least non-normal, so the non-parametric or distribution-free charts are desirable. Most non-parametric statistical process-control t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003